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Abstract 

The human immunodeficiency virus type 1 (HIV-1) subtype G is the second most prevalent HIV-1 clade in West Africa, 
accounting for nearly 30% of infections in the region. There is no information about the spatiotemporal dynamics of 
dissemination of this HIV-1 clade in Africa. To this end, we analyzed a total of 305 HIV-1 subtype G pol sequences isolated 
from 11 different countries from West and Central Africa over a period of 20 years (1992 to 2011). Evolutionary, 
phylogeographic and demographic parameters were jointly estimated from sequence data using a Bayesian coalescent- 
based method. Our analyses indicate that subtype G most probably emerged in Central Africa in 1968 (1956-1976). From 
Central Africa, the virus was disseminated to West and West Central Africa at multiple times from the middle 1 970s onwards. 
Two subtype G strains probably introduced into Nigeria and Togo between the middle and the late 1970s were 
disseminated locally and to neighboring countries, leading to the origin of two major western African clades (G WA _| and G WA . 
n). Subtype G clades circulating in western and central African regions displayed an initial phase of exponential growth 
followed by a decline in growth rate since the early/middle 1990s; but the mean epidemic growth rate of G WA .| (0.75 year -1 ) 
and G WA .n (0.95 year -1 ) clades was about two times higher than that estimated for central African lineages (0.47 year -1 ). 
Notably, the overall evolutionary and demographic history of G WA .| and G WA . M clades was very similar to that estimated for 
the CRF06_cpx clade circulating in the same region. These results support the notion that the spatiotemporal dissemination 
dynamics of major HIV-1 clades circulating in western Africa have probably been shaped by the same ecological factors. 
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Introduction 

The current distribution of human immunodeficiency virus type 
1 (HIV-1) group M subtypes and circulating recombinant forms 
(CRFs) around the world resulted from the chance exportation of 
different viral strains out of Central Africa into new geographic 
regions were these initiated secondary epidemics [1]. A recent 
study suggests that spatial accessibility (human migrations and 
movements through transportation link availability and quality) 
has played a significant role in HIV-1 spread across sub-Saharan 
Africa and may explain the heterogeneous distribution of HIV- 1 
subtypes and CRFs in the different African regions [2] . 

West Africa is one of the most strongly connected regions in 
the continent [2] and also appears as an area of intense intra- 
regional migration [3] . This coincides with an overall dominance 
of the CRF02_AG variant, that accounts for about 50% of all 
HIV-1 infections in West Africa [4]. A closer inspection of the 
HIV-1 molecular epidemiological profile in this African region, 
however, reveals an important intra-regional heterogeneity in the 
distribution of other viral clades, including subtype G and 
CRF06_cpx. Subtype G is the second most prevalent HIV-1 
clade in West Africa accounting for nearly 30% of infections in 
the region [4]. Its prevalence greatly varies within and between 



countries, comprising 30-50% of HIV-1 infections across diffe- 
rent regions from Nigeria [5,6,7,8,9,10,11,12], 5—15% in Benin, 
Niger and Togo [13,14,15,16,17], and £4% in other western 
African countries [14,18,19,20,21,22,23,24,25,26,27,28,29,30]. 
Similarly, the occurrence of the CRF06_cpx clade ranges from 
40-50% of HIV-1 infections in Burkina Faso [18,19,20], to 5- 
15% in Benin, Ghana, Mali, Niger, Nigeria, Senegal and Togo 
[5,6,7,8,9,10,11,12,13,14,15,16,17,21,22,23,24,28,29], and <3% 
in other western African countries [14,26,27,30]. 

The highly heterogeneous distribution of subtype G and 
CRF06_cpx across the well-connected western African countries, 
suggests that spatial accessibility is not enough to fully explain the 
spatial distribution of those HIV- 1 clades in this African region. A 
recent study conducted by our group suggests that Burkina Faso 
was the most important epicenter of dissemination of the HIV- 1 
CRF06_cpx strain at regional level and that CRF06_cpx 
prevalence decreases exponentially as we move away from the 
epicenter [31]. Our study also estimated that the CRF06_cpx 
clade started to spread in West Africa around the late 1970s [31], 
almost 10 years later than the estimated origin of the CRF02_AG 
clade in West Central Africa [32]. We postulated that the relatively 
late introduction of the CRF06_cpx clade into western Africa 
combined with the stabilization of the HIV epidemic in several 
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countries from the region since the early/ middle 1 990s may have 
resulted in a more limited dissemination away from the epicenter 
and a more heterogeneous regional distribution of CRF06_cpx 
when compared with CRF02_AG. 

It is unclear whether this hypothesis could also explain the 
complex distribution of subtype G in West Africa. The objective of 
this study was to reconstruct the onset date, dissemination routes 
and demographic history of the HIV-1 subtype G clade in the 
African continent. To this end, we used a Bayesian coalescent- 
based framework to analyze 305 HIV-1 subtype G pol sequences 
isolated from 1 1 different countries from West (Benin, Ghana, 
Nigeria, Senegal and Togo), West Central (Cameroon, Equatorial 
Guinea and Gabon), and Central Africa (Angola, Democratic 
Republic of Congo and Republic of Congo) over a period of 20 
years (1992 to 2011). 

Materials and Methods 

Sequence Dataset 

All HIV-1 subtype G pol sequences from West and Central 
African countries that covered the entire protease and partial 
reverse transcriptase (PR/RT) regions (nt 2253-3272 relative to 
HXB2 clone) and for which the sampling year was known, were 
downloaded from the Los Alamos HIV Sequence Database (www. 
hiv.lanl.gov) by August 2013. The subtype assignment of all 
sequences was confirmed by: REGA HIV subtyping tool v. 2 [33], 
Maximum Likelihood (ML) phylogenetic analysis, and boot- 
scanning analysis. A ML phylogeny with HIV-1 group M subtype 
reference sequences was constructed with the PhyML 3.0 program 
[34] using an online web server [35]. The ML tree was inferred 
under the GTR+I+G nucleotide substitution model recommended 
by the jModeltest program [36]. The heuristic tree search was 
performed using the SPR branch-swapping algorithm and branch 
support was calculated with the approximate likelihood-ratio 
(aLRT) SH-like test [37]. In bootscanning analyses, supporting 
branching of query sequences with HIV-1 group M subtypes 
reference sequences was determined in Neighbor-Joining trees 
constructed with the Kimura two-parameter model, within a 
250 bp window moving in steps of 10 bases, using Simplot 
software v.3.5.1 [38]. We detected that 4.7% of the subtype G pol 
sequences available in database had incorrect subtype classifica- 
tion, consistent with previous estimations [39]. Sequences with 
incorrect classification, multiple sequences from the same individ- 
ual and sequences from countries poorly represented (n<4 
sequences) were removed, resulting in a final data set of 305 
HIV-1 subtype G pol African sequences (Table 1). All codon 
positions known to be associated with major antiretroviral drug 
resistance were maintained in the final alignment because ML 
trees constructed on alignments with or without such positions 
resulted in the same overall topology (data not shown). Final 
sequence alignment is available from the authors upon request. 

Analysis of Spatiotemporal Dispersion Pattern and 
Demographic History 

The evolutionary rate (fl, nucleotide substitutions per site per 
year, subst. /site/year), the age of the most recent common 
ancestor (Tmrca> years), the ancestral geographic movements, and 
the mode and rate (r, years- 1) of population growth of HIV-1 
subtype G clades circulating in Africa were joindy estimated using 
the Bayesian Markov Chain Monte Carlo (MCMC) approach as 
implemented in BEAST vl.8 [40,41] with BEAGLE to improve 
run-time [42]. Analyses were performed under a GTR+I+G 
nucleotide substitution model. The temporal scale of evolutionary 
process was estimated from the sampling dates of the sequences 



using a relaxed uncorrelated lognormal molecular clock model 
and a uniform prior on clock rate (1.0-4.0xl0 _;! subst/site/year) 
[43]. Migration events throughout the phylogenetic history were 
inferred using a reversible discrete Bayesian phylogeographic 
model [44], in which all possible reversible exchange rates 
between locations were equally likely, and a CTMC rate reference 
prior [45]. To quantify the dissemination process, we estimated 
the number of viral migrations among locations using 'Markov 
Jump' counts [46] of location-state transitions along the posterior 
tree distribution as previously described [47,48]. Changes in 
effective population size through time were initially estimated 
using a flexible Bayesian Skyline coalescent model [49] that does 
not require strong prior assumptions of demographic history. 
Estimates of the population growth rate were subsequendy 
obtained using the parametric model (logistic, exponential or 
expansion) that provided the best fit to the demographic signal 
contained in datasets. Comparison between demographic models 
was performed using the log marginal likelihood (ML) estimation 
based on path sampling (PS) and stepping-stone sampling (SS) 
methods [50]. MCMC chains were run for 50-500 xl0 b 
generations. Adequate chain mixing and uncertainty in parameter 
estimates were assessed by calculating the Effective Sample Size 
(ESS) and the 95% Highest Probability Density (HPD) values, 
respectively, using the TRACER vl.6 program [51]. Maximum 
clade credibility (MCC) trees were summarized from the posterior 
distribution of trees with TreeAnnotator and visualized with 
FigTree v 1.4.0 [52]. Migratory events across time were summa- 
rized using the cross-platform SPREAD application [53]. 

Results 

Origin of the HIV-1 Subtype G and Identification of Major 
African Clades 

We analyzed 305 HIV- 1 subtype G pol sequences isolated from 
11 African countries between 1992 and 2011 that were sampled 
across seven different location states (Table 1). Neighboring 
countries from West (Togo/Ghana), West Central (Gabon/ 
Equatorial Guinea) and Central (Angola/Democratic Republic 
of Congo/Republic of Congo) Africa comprising few samples («< 
15) were grouped into the same location (Table 1). According to 
the Bayesian MCMC analysis, the median evolutionary rate of the 
HIV-1 subtype G lineage at pol gene was estimated at 2.3x10 3 
(95% HPD: 1.8xl0" 3 -2.8xl0" 3 ) subst. /site/year. The estimat- 
ed coefficient of rate variation in our dataset was 0.28 (95% HPD: 
0.24—0.32), thus supporting a significant variation of substitution 
rate among branches and the use of a relaxed molecular clock 
model. The most probable root location of the subtype G clade 
was placed in Central Africa (posterior state probability, 
PSP= 0.88), and the onset date of this clade was estimated to be 
1968 (95% HPD: 1956-1976) (Fig. 1). 

The Bayesian MCC (Fig. 1) and ML (Fig. SI) trees point to a 
clear phylogeographic subdivision of subtype G strains from West 
and Central Africa. Sequences from western Africa branched 
mostly in two large monophyletic clades (Gwa-i an d Gwa-ii) that 
were nested among the most basal clades from Central and West 
Central Africa (Gca)- Distribution of HIV-1 subtype G clades 
greatly varies across countries within each region (Fig. 2). The 
Gwa-i clade was the predominant subtype G lineage detected in 
Nigeria (80%) and the Gwa-ii clade predominates in Togo/Ghana 
(86%). The subtype G epidemic in Benin is dominated by both 
Gwa-i (47%) and Gwa-ii (40%) clades, whereas G W a-i (50%) and 
Gca (42%) clades prevail among subtype G infections in Senegal. 
Basal G GA clades predominate in countries from both central 
(100%) and west central (50-71%) regions. 
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Table 1. HIV-1 subtype G pol dataset. 





Region 


Country 


Location 3 


N 


Sampling interval 


West Africa 


Benin 


BJ 


15 


2004-2009 




Ghana 


TG/GH 


8 


2002-2009 




Nigeria 


NG 


183 


1992-2010 




Senegal 


SN 


12 


1998-2010 




Togo 


TG/GH 


13 


2006-2008 


West Central Africa 


Cameroon 


CM 


31 


1997-2011 




Gabon 


GA/GQ 


6 


2000-2008 




Equatorial Guinea 


GA/GQ 


4 


2005-2009 


Central Africa 


Angola 


AO/CD/CG 


13 


1997-2010 




DRC 


AO/CD/CG 


12 


1993-2007 




Republic of Congo 


AO/CD/CG 


8 


2003 



a Location assigned in the Bayesian phylogeographic analysis. DRC: Democratic Republic of Congo. 
doi:1 0.1 371 /journal.pone.0098908.t001 



Spatiotemporal Dispersal Pattern of the HIV-1 Subtype G 
African's Epidemic 

Reconstruction of viral migrations across time revealed the 
occurrence of multiple introductions of HIV-1 subtype G strains 
from Central into West Africa since the middle 1970s (Fig. 3). The 
earliest viral migrations led to the origin of the Gwa-i an d G\va-ii 
lineages. The G WA j clade most probably emerged in Nigeria 
(PSP-l) around 1974 (95% HPD: 1966-1981) and from this 
country was later disseminated to Benin, Cameroon, Equatorial 
Guinea, Ghana, and Senegal. The G W a-ii clade most probably 
emerged in Togo/Ghana (PSP= 0.68) around 1979 (95% HPD: 
1973-1984) and was disseminated to Nigeria in 1981 (95% HPD: 
1976-1986), where it further spread locally. In the following years, 
the Gwa-ii clade was disseminated from both Togo/Ghana and 
Nigeria to Benin, Cameroon, Gabon, and Senegal. Our 
phylogeographic analysis also detected several independent 
introductions of subtype G variants from Central Africa into 
Cameroon (Figs 1 and 3). The earliest introductions occurred 
between the late 1970s and the middle 1980s and gave rise to at 
least three local Cameroonian clades; one of which was further 
disseminated to Gabon, Equatorial Guinea, Senegal and Angola. 

We next quantified the viral flux between locations using 
Markov jump counts (Fig. 4 and Table SI). Nigeria (16.4), central 
African countries (14.8), and Togo/Ghana (8.3) displayed positive 
net viral migration rates (efflux minus influx), whereas Benin (— 
14.2), Cameroon (—10.1), Gabon/Equatorial Guinea (—8.4), and 
Senegal (—6.8) displayed negative net viral migration fluxes. The 
highest numbers of viral transitions were from Nigeria to Benin 
(8.1), Togo/Ghana (5.5) and Cameroon (4.4), from Central Africa 
to Cameroon (6.1) and Senegal (4.2), and from Togo/Ghana to 
Benin (5.7), Nigeria (4.1) and Cameroon (3.9). The estimated viral 
flux to Gabon/Equatorial Guinea from Cameroon (2.6), Central 
Africa (2.3), Nigeria (2.3) and Togo/Ghana (2.3) was very similar. 

Demographic History of HIV-1 Subtype G African's 
Epidemic 

Estimations of effective population size (jVc) changes over time 
were initially obtained using a Bayesian skyline plot (BSP) 
coalescent model. The BSP analysis of the complete dataset 
suggests that the subtype G African epidemic experienced a fast 
exponential growth during the 1970s and 1980s, followed by a 
more recent stabilization since the early 1990s (Fig. 5A). This 



overall growth pattern, however, represents the combined 
population dynamics of the different African subtype G clades 
that are being disseminated within different countries and regions. 
In order to better understand the regional differences in the 
demographic histories of HIV- 1 subtype G African epidemics, the 
Gca, Gwa-i an d Gwa-ii clades were analyzed separately (Table 
S2). 

The BSP analyses suggest that all African subtype G clades 
displayed a similar population growth pattern characterized by an 
initial phase of exponential growth followed by a decline in growth 
rate since the early/middle 1990s (Figs. 5B, D and F). To estimate 
the mean epidemic growth rate of the major subtype G African 
clades, log ML for the logistic, exponential and expansion growth 
models were calculated using both PS and SS methods. The best- 
fit demographic model for all subtype G clades was the logistic one 
(log BF>5) (Table S3) that was then used to estimate the initial 
epidemic growth rate. The overall time-scale and demographic 
pattern obtained from both BSP (Figs. 5B, D and F) and logistic 
growth coalescent tree priors (Figs. 5C, E and G) were very similar 
and important differences in the epidemic growth rate were 
detected across subtype G clades from West and Central Africa. 
According to the logistic growth coalescent model, the mean 
growth rate of clades Gwa-i (0.75 year ') and G W a-ii (0.95 year ) 
was about two times higher than that estimated for the clade Gca 
(0.47 year" 1 ) (Fig. 5). 

Discussion 

This study indicates that the HIV-1 subtype G likely originated 
in Central Africa around the late 1960s. The root position of the 
subtype G clade is fully consistent with the most accepted model 
that traces the origin of all HIV- 1 group M subtypes to the DRC 
[54,55,56,57] and is also resistant to the problem of sampling bias 
because sequences from Central Africa represent a minor fraction 
(9.2%) of the total subtype G sequences included in our study. The 
Tmrca of subtype G clade here estimated (1968: 1956-1976) is 
also fully consistent to that previously estimated for this subtype 
(1970: 1960-1978) [58]. This onset date is comparable to that 
estimated for subtype F (1967: 1956-1976) [59]; but more recent 
than that of subtypes A 1 (1954: 1940-1968), C (1955: 1934-1972), 
and D (1947: 1938-1955) [58]. 

After emerging in Central Africa around the late 1960s, the 
HIV-1 subtype G was disseminated to West and West Central 
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WA-I 



Figure 1. Time-scaled Bayesian MCC tree of the HIV-1 subtype G po/PR/RT sequences (~1,000 nt) circulating in West and Central 
Africa. Branches are colored according to the most probable location state of their descendent nodes as indicated at the legend (bottom left). Arcs 
indicate the positions of major subtype G clades characteristic of western (G WA .| and G WA . M ) and central (G C a) African regions. Asterisks point to key 
nodes with high posterior state probability support (PSP>0.85). Branch lengths are drawn to scale of years. The tree was automatically rooted under 
the assumption of a relaxed molecular clock. 
doi:1 0.1 371 /journal.pone.0098908.g001 



Africa a few years later (1975-1980). Our phylogeographic 
analysis supports the occurrence of multiple introductions of 
HIV-1 subtype G strains from central into the western and west 
central African regions. Some of the viral strains disseminated 
during the 1970s fueled secondary outbreaks that led to the origin 
of specific subtype G clades. The major subtype G clades detected 
in our study were the G w a-i that most probably emerged in 
Nigeria around the middle 1970s, and the Gwa-ii ma t most 
probably emerged in Togo or Ghana around the late 1970s. 
Although we grouped sequences from Togo and Ghana into one 
single location, the much higher prevalence of subtype G in Togo 
(9%) [16,17] compared with Ghana (<1%) [23,24] suggests that 



the Gwa-ii clade probably arose in Togo. We also detected three 
minor subtype G clades that resulted of independent introductions 
of viral strains from central Africa into Cameroon between the late 
1970s and the middle 1980s. 

Nigeria and Togo/Nigeria were inferred as the most important 
epicenters of dissemination of the G WA -i and G WA .n clades at 
regional level, respectively. The G WA -i clade, which corresponds to 
the clade previously designated G' [5,8], was the predominant 
subtype G lineage in Nigeria (80%), Senegal (50%), and Benin 
(47%), and also comprises a significant fraction of subtype G 
infections in Gabon/Equatorial Guinea (20%), Cameroon (13%) 
and Togo/Ghana (9%). The Gwa-ii clade predominates in Togo/ 
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Benin 




Cameroon 
13% 




(N = 31) 



Togo/Ghana 

(N = 21) 



Gabon/Equatorial Guinea 

(N = 10) 



Angola/DRC/Rep. of Congo 

(N = 33) 



»WA-I 



'WA-II 



'CA 



Figure 2. Prevalence of G WA -i, G WA -m and G C a clades among subtype G infected individuals from different African countries, 
estimated from phylogenetic analyses presented in Figs. 1 and SI. The total number of subtype G sequences analyzed In each locality Is 
indicated. Each clade is represented by a color as indicated at the legend. 
doi:1 0.1 371 /journal.pone.0098908.g002 



Ghana (86%) and is responsible for a significant fraction of subtype 
G infections in Benin (40%), Gabon/Equatorial Guinea (30%), 
Nigeria (20%), Cameroon (16%) and Senegal (8%). The subtype G 
clades introduced into Cameroon were mainly disseminated to the 
neighboring countries in the central west region (Gabon and 
Equatorial Guinea), although a few disseminations to Senegal were 
also detected. These results indicate that founder subtype G strains 
introduced into Nigeria and Togo have been much more 
efficiently disseminated at regional level than those introduced 
into Cameroon. 

Our demographic reconstructions also revealed another impor- 
tant difference between African subtype G clades mainly 
disseminated in the western region (G WA _! and G WA _ n ) and those 



mainly disseminated in the west central and central regions (Gca). 
Although all African subtype G clades displayed a similar 
population growth pattern characterized by an initial phase of 
exponential growth followed by a decline in growth rate since the 
early /middle 1990s; the mean epidemic growth rate of G WA _i 
(0.75 year ') and Gwa-ii (0.95 year ) clades was about two times 
higher than that estimated for Gca (0.47 year -1 ) clades. This 
suggests that subtype G clades introduced into Nigeria and Togo 
during the 1970s probably encountered more favorable conditions 
for local and regional expansion than those disseminated within 
central and west-central African countries around the same time. 
The median growth rates of the Gwa-i an d G W a-ii clades were 
comparable to that estimated for the CRE06_cpx in western 
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1990-1999 



SN 



TG/GH 



GA/GQ AO/CD/CG 



Figure 3. Spatiotemporal dynamics of HIV-1 subtype G clade dissemination in West and Central Africa. Snapshots of viral migration 
events occurring at different time intervals between 1970 and 2012 are shown. Lines between locations represent branches in the Bayesian MCC tree 
along which location transitions occur. Each location is represented by a color as indicated at the legend. SN: Senegal; TG/GH: Togo/Ghana; BJ: Benin; 
NG: Nigeria; CM: Cameroon; GA/GQ: Gabon/Equatorial Guinea; AO/CD/CG: Angola/DRC/Republic of Congo. 
doi:1 0.1 371 /journal.pone.0098908.g003 



Africa (0.82 year ) [31]; whereas the median growth rate of the 
Gca clades was roughly similar to that estimated for subtype G in 
Cuba (0.54 year ) [60] and higher than that estimated for HIV-1 
group M in Democratic Republic of Congo (0.17 year -1 ) [61]. 

The faster epidemic growth and the broader geographic 
dissemination of subtype G strains introduced into West Africa 
compared with those circulating in the central west and central 
African regions could be associated to clade-specific or regional- 
specific differences in viral transmissibility. It has been suggested 
that accessibility between locations have played a major role in the 
spatial spread of HIV-1 in sub-Saharan Africa [2]. Notably, West 
Africa is one of the most strongly connected regions in the 
continent [2] and also displays an intra-regional migration rate 
(3%) above the African average (2%) [3]. Others factors including 
urbanization [56,62], iatrogenic interventions [63,64], and forced 



migration [62,65] might have also played a role in the emergence 
and spread of HIV in Africa. Such alternative scenarios can now 
be tested in a Bayesian framework [66] to find the hypothesis that 
best explain the variability in the rate of HIV spread across 
African regions. 

Despite the strong regional accessibility, the prevalence of 
subtype G and CRF06_cpx clades greatly vary across western 
African countries. The clades CRF06_cpx, G W a-i, an d Gwa-ii 
seem to have experienced very similar dissemination dynamics; 
although their origin was traced to different western African 
countries (Burkina Faso, Nigeria and Togo, respectively) [31]. The 
three HIV-1 clades probably started to spread in West Africa 
around the same time (1975-1980), expanded during the 1980s 
with similar epidemic growth rates (0.75-0.95 year -1 ), started to 
stabilize around the early /middle 1990s, and their prevalence is 
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Figure 4. Viral migration rates among locations as measured using 'Markov jump' counts. Each panel represents the estimated viral 
exchanges from and to Angola/DRC/Republic of Congo (A), Cameroon (B), Gabon/Equatorial Guinea (C), Benin (D), Togo/Ghana (E), Nigeria (F), and 
Senegal (G). The width of the arrows is proportional to the corresponding mean estimated number of viral transitions between locations according to 
the following scale: thin arrows = 1 .0-2.9 transitions, medium arrows = 3.0-5.9 transitions, thick arrows = 6.0-8.9 transitions. No arrows were displayed 
when the mean estimated number of transitions was below one. 
doi:1 0.1 371 /journal.pone.0098908.g004 
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H 

1965 



G: BSP 



1989 1999 
Year 




r = 0.47 (0.33-0.S3) 




LG 



1995 
Year 



r=0.75 (0.56-0.98) 




U 
1975 



WA-I 



: BSP 



1995 
Year 




I- 
1975 




'WA-II' 



BSP 



1995 

Year 



r=0.95 (0.68-1.30) 



1- 
1975 



'WA-II 



LG 



1995 
Year 



Figure 5. Demographic history of the HIV-1 subtype G and the clades G CA , G WA _, and G WA _ M circulating in Central and West Africa. 

Effective number of infections (y-axis; Iog10 scale) through time (x-axis; calendar years) estimated using Bayesian skyline (A, B, D, F) and logistic (C, E, 
G) growth coalescent model. Median estimates of the effective number of infections (solid line) and 95% HPD intervals of the estimates (dashed lines) 
are shown in each graphic. The median growth rate (with the corresponding 95% credibility interval in parenthesis) of each clade estimated under 
the logistic growth model is indicated in the upper left corner. 
doi:1 0.1 371 /journal.pone.0098908.g005 



greatly reduced as we moved away from the corresponding 
epicenters [31]. The relatively late spread of subtype G and 
CRF06_cpx clades in West Africa combined with: 1) stabilization 
of the HIV epidemic in several western African countries since the 



early/middle 1990s, and/or 2) depletion of the susceptible 
populations most at risk by the firstly introduced CRF02_AG 
lineage, may have limited the dissemination of these viral clades 



PLOS ONE | www.plosone.org 



8 



June 2014 | Volume 9 | Issue 6 | e98908 



Phylodynamics of the HIV-1 Subtype G 



far from the epicenter, thus generating a heterogeneous spatial 
distribution. 

The most important limitation of our study was the small 
sampling size of many African countries. Only Nigeria (n= 183) 
and Cameroon (k = 31) were represented by a high or relatively 
high number of sequences. Other western (Benin, Niger, and 
Togo) and central (Central African Republic, Chad, Equatorial 
Guinea, and Gabon) African countries with circulation of subtype 
G at significant levels (S5% of all HIV-1 infections) 
[13,14,15,16,17,67,68,69,70,71,72] were represented by a small 
number of sequences (nSl5) that may not fully reflect the 
country's subtype G diversity, or were not represented at all in our 
study (Fig. S2). Thus, a more comprehensive and balanced 
sampling from countries poorly or not represented here would 
certainly provide more precise estimates of the relative prevalence 
and migration routes of clades Gwaj, G WA _n and Gca across 
different African regions, and may also result in the identification 
of new regional viral clades not detected in this study. 

It will be also interesting to trace the origins and global dispersal 
pathways of those subtype G lineages found in countries outside 
sub-Saharan Africa, particularly in Cuba [73,74,75], Portugal 
[76,77,78], and Russia [79] where this subtype has been 
disseminated among local populations. It has been showed that 
the spread of HIV-2 outwards Africa mirrors socio historical ties 
[80] and a previous study conducted by our group showed that 
most subtype G Cuban lineages are nested among basal sequences 
from Central Africa [60] . Thus, circulation of subtype G outside 
sub-Saharan Africa may be linked to the presence of Portuguese, 
Cuban, and Russian personnel in Angola and neighboring 
countries during 1960-1990. 

In summary, this study suggests that the HIV- 1 subtype G clade 
started to circulate in Central Africa around the late 1960s and 
was disseminated to West and West Central Africa from the 
middle 1970s onwards. Nigeria and Togo were pointed out as the 
major secondary hubs of dissemination of subtype G within 
western and west central African regions. Our data also highlight 
that the spatiotemporal dissemination dynamics of western African 
subtype G clades were very similar to that estimated for the 
CRF06_cpx epidemic; supporting the notion that current distri- 
bution of major HIV-1 clades in West Africa may have been 
shaped by the same ecological factors. Despite some study 
limitations, these findings offer important insights toward an 
understanding of the current characteristics and dynamics of the 
HIV-1 epidemic in West and West Central Africa. 
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